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Abstract. Cellular automata (CA) are dynamical systems denned by a finite local rule 
but they are studied for their global dynamics. They can exhibit a wide range of complex 
behaviours and a celebrated result is the existence of (intrinsically) universal CA, that 
is CA able to fully simulate any other CA. In this paper, we show that the asymptotic 
density of universal cellular automata is 1 in several families of CA defined by local sym- 
metries. We extend results previously established for captive cellular automata in two 
significant ways. First, our results apply to well-known families of CA (e.g. the family 
of outer-totalistic CA containing the Game of Life) and, second, we obtain such density 
results with both increasing number of states and increasing neighbourhood. Moreover, 
thanks to universality-preserving encodings, we show that the universality problem remains 
undecidable in some of those families. 



Introduction and definitions 

The model of cellular automata (CA) is often chosen as a theoretical framework to study 
questions raised by the field of complex systems. Indeed, despite their formal simplicity, 
they exhibit a wide range of complexity attributes, from deterministic chaos behaviours 
(e.g. |3j) to undecidability in their very first dynamical properties (e.g. [2]). One of their 
most important feature is the existence of universal CA. Universality in CA is sometimes 
defined by an adaptation from the model of Turing machines and sequential calculus. But 
a stronger notion, intrinsic to the model of CA, has emerged in the literature [7J: a CA is 
intrinsically universal if it is able to fully simulate the behaviour of any other CA (even on 
infinite configurations). 

Besides, when it comes to modelling [Ij or experimental studies \10\ 111) , most works 
focus on some particular syntactical families (elementary CA, totalistic CA, etc), either 
to reduce the size of the rule space to explore, or to match hypothesis of the studied 
phenomenon at microscopic level (e.g. isotropy). 
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In a word, CA are known for their general ability to produce complex global behaviours, 
but local rule considered in practice are often very constrained. This paper studies the link 
between syntactical restriction on CA local rules and typical global behaviours obtained. It 
establishes a probabilistic result: for various symmetry criterions over local rules, randomly 
choosing a local rule within the symmetric ones yields almost surely universal CA. Mean- 
while, the universality problem is shown to remain undecidable even restricted to symmetric 
rules (for some of the symmetry criterions). 

A family of CA defined by a simple syntactical constraint (namely captive CA) and 
containing almost only universal CA has already been proposed by one of the authors [9], 
but the present paper goes further. First, it generalises the probabilistic framework: the 
neighbourhood of CA is no longer fixed as it was needed in [9j. Second, it considers well- 
known families of CA (e.g. totalistic or outer-totalistic CA) and generalisations of them, 
namely multiset CA, which are meaningful for modelling (they are 'isotropic' CA). 

After having recalled standard definitions about CA (end of this section), section Q] 
presents the families considered in this paper. Then, section [2] defines intrinsic universality 
and the simulation relation involved in that notion. Section [3] gives the probabilistic setting 
of the paper and establishes the main probabilistic results. Finally, section O is dedicated 
to existence proofs of universal CA in the families considered. Combined with probabilistic 
results, it proves that almost all CA are universal in those families. 

Definitions and notations. In this paper, we adopt the setting of one-dimensional cellular 
automata. Formally, a CA is a 3-uple A = (n, k, 8^) where n and k are positive integers, re- 
spectively the size of the state set Q n = {1, . . . , n} and of the neighbourhood [[— L^pJ ; [_f J 1 
$A '■ Q n — > Qn is the local transition function. 

A coloring of the lattice Z with states from Q n (i.e. an element of Q^) is called a 
configuration. To A we associate a global function Gj^ acting on configurations by synchro- 
nous and uniform application of the local transition function. Formally, G_a '■ Qn ~> Qn is 
defined by: G^(x) z = S^(x z _,k-x j ; • • • i x z +\_h.y) f° r an x £ Qn an d z € Z. 

The local function 5 a naturally extends to Q n , the set of finite words over alphabet Q n 
(with 5_a(u) being the empty word if \u\ < k). For p € N, this function maps an element of 
Qn +k to an element of Qn +1 - 

The size of A = (n, k, 5 X) is the pair (n, k). The set of all CA is denoted by CA, and 
the set of all CA of size (n,k) by CA n ^. Moreover for any set T C CA, T Ui k is defined 
by J~ n ,k = J~ H CA n fc. Formally a CA is a 3-uple but, to simplify notation, we sometimes 
consider that T n ,k is a set of local functions of type — ► Q n - 

This paper will intensively use (finite) multisets. A multiset M of elements from a set E 
is denoted by M = {{(ei, m ),..., (e p , n p )}} where a pair (ej, rtj) E E x N denotes an element 
and its multiplicity. The cardinality of M is \M\ = X]j n «- The cardinality notation is the 
same for sets. 

1. Families of CA with Local Symmetries 

In this section, we define various families of CA characterised by some local symmetry. 
'Symmetry' must be taken in a broad sense since it may concern various aspects of the local 
function. We first consider families where the local function does not depend on the exact 
configuration of the neighbourhood (a A:-uple of states) but only on a limited amount of 
information extracted from this configuration. 
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MultiSet CA. Multiset cellular automata are cellular automata with a local rule invariant 
by permutation of neighbours. Equivalently, they are CA whose local function depends only 
on the multiset of states present in the neighbourhood. Formally, A € CA n:k is multiset, 
denoted by A € MS n fc, if for any permutation tt of {1 ... k}, the local function S A satisfies 

Vai, . . . ,a k £ Q n : 5 A {ai, ...,a k ) = S^a^, . . . ,a n ^). 

Set CA. Set CA are a special case of multiset CA: they are CA whose local function 
depends only on the set of states present in the neighbourhood. Formally, A € CAn^ with 
arity k is a set CA, denoted by A <E Set Hj fc, if 

W,v G : . ,u k } = {vi, . .. ,v k } S A (u) = 5 A (v). 

Note that for fixed n, there is a constant N such that, for all k, \Set nk \ < -/V. Thus there 
is no hope that the asymptotic density of a non-trivial property for fixed n be 1 for family 
Set. 

Totalistic CA. Totalistic CA are also a special case of Multiset CA: they are CA whose 
local functions depends only on the sum of the neighbouring states. Formally, A £ CA n ^ 
k is totalistic, denoted by A € Tot^, if 

k k 

Vu,v E Q*:^2ui = ^2vi^ 5 A {u) = 5 A (v). 
i=i i=i 

Partial Symmetries. We can consider weaker forms of each family above, by excluding 
some neighbours from the 'symmetry' constraint and treating them as a full dependency 
in the local function. For instance, we define the set of outer-multiset CA as those with a 
local rule depending arbitrarily on a small central part of their neighbourhood and on the 
multiset of other neighbouring states. Formally, for any k', < k! < k, O k 'M.S ntk is the set 
of CA with n states, arity k and such that for any permutation w of {1 . . . k — k'} and any 
01, ... , a k -k',h, ■ ■ ■ , h' G Qn we have: 

5 A (ai, . . . ,a^ k _ k ,y 2 },h, ■ ■ ■ ,h',ai(k-k')/2\+i, ■ ■ ■ , a k~k>) 

= ^( a 7r(l)> • • ■ > a 7r(L(fc-fc')/2j)'frl' • • • A' 5 «7r(L(fc-fc')/2j+l)> • • • , a n(k-k'))- 

We define in a similar way outer-totalistic and outer-set, and denote them by CvTot nj fc 
and O k /Set n k respectively. Note that what is classically called outer-totalistic is exactly 
the family OiTot n k . 

State symmetric CA. Families above are variations around the invariance by permu- 
tations of neighbours. State symmetric CA are CA with a local function invariant by 
permutation of the state set. Formally, a CA A G CA 1ljk is state symmetric, denoted by 
A £ SS n>k , if for any permutation ir of Q n we have: 

Vai, ...,a k : 5 A (ai, ...,a k ) = ir^ 1 (5 A (n(ai), . . .,ir(a k ))). 

Note that we have a situation similar to the case of Set: for fixed k, there is a constant K 
such that, for all n, |SS n> fe| < K. Thus their is no hope that the asymptotic density of a 
non-trivial property for fixed k be 1 in state symmetric CA. 
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Captive CA. Finally, we consider the family of captive CA already introduced in [8]: they 
are CA where the local function is constrained to produce only states already present in 
the neighbourhood. Formally, a CA A £ C A n j, is captive, denoted by A £ K^, if: 

Vai, . . . ,a k : £4(01, • • • & {>i, • • 
The following lemma shows a strong relationship between captive and state symmetric CA. 

Lemma 1.1. Let n and k be such that 1 < k < n — 2. Then we have SS n ,k ^ K n ±. 

Combining symmetries. In the sequel, we will often consider intersections of two of the 
families above. Note that all intersections are generally non-trivial. However, for the case 
of Tot nj fc and K n) fc, the intersection is empty as soon as there exists two fc-uple of states 
with disjoint support but with the same sum, because the 'captive' constraint forces the two 
corresponding transitions to be different whereas the 'totalistic' constraint forces them to be 
equal. This happens for instance when n > 3 and k is even with A;-uples (1, 3, 1, 3, . . . , 1, 3) 
and (2, 2, 2,..., 2). 

2. Simulations and Universality 

The property we are mostly interested in is intrinsic universality (see [7] for a survey 
on universality). To formalize it, we first define a notion of simulation. 

A CA A is a sub- automaton of a CA B, denoted A C J3, if there is an injective map ip 
from A to B such that 7p o Gjs, = Gq o Tp, where Tp : A 1, — ► B 1, denotes the uniform extension 
of ip to configurations. We sometimes write A B to make tp explicit. This definition is 
standard but yields to a very limited notion of simulation: a given CA can only admit a 
finite set of (non-isomorphic) CA as sub-automata. Therefore, following works of J. Mazoyer 
and I. Rapaport [I] and later N. Ollinger [51 E], we will consider the following notion of 
simulation: a CA A simulates an AC B if some reseating of A is a sub-automaton of some 
rescaling of B. The ingredients of the rescalings are simple: packing cells into blocs, iterating 
the rule and composing with a translation (formally, we use shift CA a z , z £ Z, whose global 
rule is given by <j{c) x = c x - z for all x £ Z). Formally, given any state set Q and any m > 1, 



Vz £ Z : [b m (c)j (z) = (c(mz), . . . , c(mz + m — 1)) 

for all c £ Q z . The rescaling _4< m >M> Q f A by parameters m (packing), t > 1 (iterating) 
and z £ Z (shifting) is the CA of state set Q m and global rule: 



With these definitions, we say that A simulates B, denoted B ^ A, if there are rescaling 
parameters mi, TO2, ii, £21 #1 an d 22 such that ; g< m i.*i. z i> □ _4<m2,*2,22>_ j n se quel, we 
will discuss supports of simulations, i.e. sets of configurations on which simulations occur, 
jf j g<mi,t 1 ,z 1 > j(<m 2 ,t2,z2> ^ ^ e SU pp 0r t of the simulation is the set of configuration of A 

defined by oTpo 6 mi (Q|). It is a subshift: a closed shift-invariant set of configurations. 
In the sequel we denote by B =4x A the fact that A simulates B on support X. 

Once formalised the notion of simulation, we naturally get a notion of universality: 
CA able to simulate any other CA, denoted A £ hi. This notion associated to ==! is called 
intrinsic universality in the literature (see [7]). Actually, an intrinsically universal CA A 




b m o a z o G\ o b. 
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has the following stronger property (see 0[5]): for all B, there are rescaling parameters m, 
t and z such that B E A <m ^ z> . 



3. Asymptotic Density and Monotone Properties 



3.1. Asymptotic density 

When considering a property V and a family T (two sets of CA), we can define the 

I J~ rrPl 

probability of V in T n k by p n i. = i ~ . . Our probabilistic framework consists in taking 

K n,fc I 

the limit of this probability p n k when the "size" (n and/or k) of the automata grows toward 
infinity. In [9], only a particular case was considered: k fixed, and n — > oo. The following 
definition consider all possible enumerations of 'size' through the notion of path. 

Definition 3.1. A path is an injective function p : N — > N 2 . When the limit exists, we 
define the asymptotic density of V in T following a path p by 

The family of possible paths is huge and two different paths do not always define 
different densities. 

We denote N co = N \ {0, 1, . . . , Co — 1}. Since we consider asymptotics, we can restrain 
to paths p : N — > N no x r% without loss of generality. 

In the following, we will obtain limit densities of value 1, which justifies the use of 
non-cumulative density : in our case a density 1 following a given path implies a cumulative 
limit density 1 along this path. 



3.2. Density of monotone properties among symmetric family 

A property V is said to be increasing with respect to simulation if \/A € V, A =4 & 
implies B € V. Decreasing properties are defined analogously. In this section we prove that 
monotone properties have density or 1 among symmetric families introduced in section 
[1] following particular paths. More precisely, we are going to show that any non-trivial 
increasing property has density 1. 

. For any local function / : — > Q n , for any set E Q Q^, we denote by $\e the restriction 
of / to E. We also denote J^n^E = {/Is : / £ ^n.fe}- Let {Ei}i £ i be a finite family of 
subsets of and denote E = U^jEi. We say that the family {Ei}i e i is independent for 
T if the map 

i€l 

defined by ip(f) = (f\ Q k\ E , f\ El , ■ ■ ■ , f\E t , ■ • • ) is a bijection (it is always injective). 

By extension, we say that a collection of subshifts {Aj}j g / is independent if the family 
{E(Xi)}i e j is independent, where E(X{) C is the set of words of length k occurring in 
•Y„ 

Let S Ao = {A G CA : A 4 A} and S AojX = {A e CA : A 4x A}. 
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Lemma 3.2. Let J- C CA, and Ao £ J~no,ko a given CA. For any size (n,k) (n > no, 
k > ko) and any collection of subshifts {XA^j, we denote a.{ = ^ n ' k< ^ A o> x i j or a \\ ^ jj 

K n,k | 

{Xi}i e j is independent for T , then 

^^ .>i-na-^) 

Proof. We use the notations above. As the property *4o =^X; *4 is only determined by 
the restriction of A to E(Xi), there exists Aj C J^^l^;. such that ^{T n ,k D 5^ ,xJ = 
•^n.fclofcVE x -^n.feUi x • • • x ^fel^ xiiX J r n ,fc|js <+1 • • • • And as the family {-Ei}j e j is 
independent for T , ib is bijective and oti = <J Al j — r. 

K n , k | | 

By definition of we have the following inclusion: \J ieI (J 7 n ,k^<S_A 0t x i ) Q (^,^5^). 
To conclude, it is enough to use the fact that tp is bijective in order to express the size of 
these sets' complement in T n ^. 



3.2.1. Increasing n, fixed k. 

Proposition 3.3. In the following, E is chosen among CA, MS, Set, Ok'Set, Ok'MS. 

For any Ao £ En K no ^ Q , for all e, there exists n t) k such that if n > n e ^ 

\{EnK nM )C\s Ao \ > 1 _ £ 
\En K nj k \ 

Thus, any increasing property V such that 3Ao £ E n K n / t n V has density 1 in family 
EflK for paths with fixed k. The case E = CA was already proved in [9]. 

Proof. Let {Aj} ig [[ j be a collection of fullshifts on disjoints alphabets of size n . They 

are independent for family EnK, whatever the choice of E. Because of captivity constraint, 

the simulation happens on Xi with probability Qj „ fc > co = l/ra ° . We obtain by lemma 



3.2.2. Increasing n, fixed k. In the following, we use lemma [3T2l with an increasing number 
/ = 0{k) of independent simulation subshifts, each providing the desired property for a 
constant fraction d n of J- n /% (n is fixed). It gives ^ n '^ SA o > i _ n _ ^ V anc [ we OD tain a 
limit density dk^(S_A ) = 1. 

Multiset CA. 

Proposition 3.4. For all Ao £ MS n(h k , for all e > 0, for all n > no + 2, i/iere exists k e 
such that for all k > k f , ^ MS r^ nS f ^ > 1 — e. 

■> t; \Mb n ,k\ 

Proof. We consider a multiset CA Ao £ MS nQj fc , a size n > no + 2/co + 4, and a given e > 0. 
In order to clarify the construction we denote the 2 biggest states of Q n by Oo and lo- For 
any size k, we define I = [ jgjjjj j and o = k — l.ko- And for any j £ \ko + 1; / — &o — 11 -Wj 

is the word Ma = di - "' • In- 
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We define the simulating subshift Xj as the set of configurations alternating a state of 
Q no and a pattern Mj. The family {Xj}j is independent for multiset CA. On every such 
subshift, the simulation will happen if the CA maintains the structure (eventually shifted) 
and computes steps of Aq. Multisets corresponding to patterns of length k occurring in Xj 
are: 

• ^j',{(ll,l),(l2,l)i-,(% ,1)} = 

{{(Oo, (k - '+ o), (1 , (k - (xi, 1); (x 2 , 1), . . . , (x ko ,l)}} 

• For < s < o, Wj S)fco _ 1 = 

{{(0 , (k - + o + 1 - a), (1 , (fe - 1).(Z - j) + s), (xi, 1), (x 2 , 1),..., (x ko _ u 1)}} 

• w lk -i = {{(Oo, (fco-l)-i), (lo, (ko-l).(l-j)+o+l), (xi,l), (x 2 , 1), . . . , (x fc0 _!, 1)}} 

• For < s < o - 1, Wj[ aJko = 

{{(0 , (fcb - + s), (1 , (fcb - - j) + o - s), {x x , 1), (x 2 , 1), . . . , (x fco , 1)}} 
«4o is simulated on support Xj if we have the following: 

• M^,{(:ziT),Oz2,l),...,Orfc ,l)}) = ^({{(^l, 1), 0*2, 1), ••■(Xfc , 1)}}) 

• ^(^>,fe -l) = °0 with < S < O 

• ^(^ fe0 -i) = ^(W/U) = lo with < s < o - 1 

The number of involved legal multiset transitions for a given subshift Xj is less than 
(2.fco + 1).71q . Thus, the proportion of CA in MS nj fc simulating Aq on Xj is at least 
]y n (2.fc +i)-n w j 1 j c j 1 j s constant with increasing k. And the number of such possible subshift 
is I = 0(k). We conclude with lemma [3721 as explained before. ■ 

Totalistic CA. We manage to make the multiset construction above to become totalistic. 
To do it, we define the mapping ipj by: Vx G Qn , <fij{x) = (x(no + 1)) • O^- 7 • 1^, with Oo = 
and lo = rao(no + 1) + 1. The j-th subshift is defined as the smallest subshift containing 

{fj(Qn)) ■ The transitions are distinguishable by the number of lo, and the number of 
states smaller than no (no + 1) in any legal neighbourhood. The probability to simulate the 
original CA on the j-th subshift is constant, and the simulating subshifts are independent 
for totalistic CA. As the number of possible simulation increases, the limit probability for 
any CA to simulate a given CA is increasing to 1. 

Outer-multiset CA. We still consider the same possible simulations of any multiset CA 
Aq € CA„ 0ifco by a CA O fe /MS n ,&. 

As A is only partially multiset, the number of transitions involved in a simulation on 
one given subshift has increased: we have to consider the transitions with every possible 
central pattern of size k'. Using a precise account, we ensure that the number of transitions 
involved in one given simulation is bounded by c k with c only depending on no and kQ. And 
the number of possible subshifts for the simulation to happen is the same as in the totally 

multiset case: it is still given by I k/2 1 — 1. We obtain l°fc'^ [ ^".fc n ' s ^u > 1— ( 1 1 — _ ) 

6 |O fe ,MS„, fc | ^ (n +2) cfc J 

with I = 0(k). To ensure that c4,o fc /MS n k (^A ) = 1 it is enough to suppose that k' = 
o{log{log{k)). 



3.2.3. More general paths. 
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Multiset captive CA. We prove a slightly more general result with the family of multiset 
captive CA KMS defined by KMS = K n MS. 

Proposition 3.5. For any path p : N — > N 2 such that the lower limit of x \— > n = Tti(p(x)) 
is infinite, and for any Aq G KMS no ^ , for all e, there exists s e such that if x > s e then 

\s Ao n KMS p{x) \ 

\KMS p[x) \ > 6 

Proof. The collection of subshifts, and the simulation behaviour are exactly the same as in 
the multiset case. If Ao is captive, each simulating transition is also captive. The number of 
involved transitions is the same as in the MS case: (2.ko + l).ng°. But using the captivity 
constraint, the probability for the simulation on the j-th. subshift to happen is also bounded 
by : l/(2.fc + l).ng°. We use the fact that the number of possible simulations is still 0(k) 
to conclude using lemma 13.21 



Set captive CA. 

Proposition 3.6. For any path p : N — > N 2 such that the lower limit of x — > n = Tti(p(x)) 
is infinite, and for any Ao G KSet m ^ , for all e, there exists s e such that if x > s e then 

\SAa n K S e tpath(x)\ 

i 1 - > 1 — e 

\KSetp ath ( x }\ 

Proof. Given Aq, n, and k big enough, we denote the 2ko + 4 first states of Q n by 0j 
and lj, i G [l;A;o + 2]]. The j-th. subshift is the set of configurations alternating words 
0°l^ _o (with I = L^~i J an d o = k — l.ko) legally ordered and simulating states taken from 
Sj = [[2/co + 4 + j.no; 2k^ + 4 + j.no + uq — lj. Legal set transitions for this subshift are 

• • • • ,ak } U {0j, lj, . . . ,0 i+fc -i, lj+fc-i, Oj+fc } — > o~A {{ a ii ■ ■ ■ > a fc}) 

• {czi, . . . , flfco+e} U { If— i , 0j, lj, . . . , Oj+fe-i, lj+fc-i, Oj+fc } — > lj+fc/2 with e € {0, -1} 

• {oi, . . . ,a fco+e }U{0i, li,0j+i, lj+i, . . . , l i+ k-i,0i+k, h+k } -> i+fc /2 with e G {0, -1} 
With indicies modulo A; + 2, and a x G "Ej for all a;. For all i those transitions may be 
identified by a set CA using the underlined state. 

. So we need uq + 2.(ko + 2) different states to make the simulation on this subshift and 
the number of involved transitions is equal to 3.(ko + 2). Thus, because of captivity, the 
proportion p of CA in which one given simulation happens is constant when k, or n is 
increasing. And the family of the [ n ~ 2 ^ 0+2 - ) j possible simulation subshifts is independent. 

With lemma [3721 we obtain the inequality "jxset i~H > 1 — (1 — p) ™° • We conclude 
the proof using the hypothesis on the path, lim x _+^.,n = lim, 1 ._ 00 TT-\ (x) = oo. ■ 
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4. Encodings 

In the following we prove that there exists universal cellular automata in most of the 
families defined above. This is an important step considering the fact that some well known 
locally defined family, such as LR-permutative CA, do not contain any universal CA (be- 
cause intrinsic universality implies non-surjectivity, see [5]). In fact, for every given family 
T , we introduce an encoding map <pj? : CA —* T such that for any A, its corresponding 
encoded version ipjr{A) verifies A =4 <Pf(A). The existence of a universal CA in T follows 
by application of the encoding to any universal CA. Moreover, in some cases, we obtain a 
stronger result: the encoded CA is universal if and only if the original CA is universal. 

Set CA. Given a CA A G CA nj fc of state set Q n , we construct ^(A) G Set with state set 
Q = Q n x {0, . . . , k + 1} U {#} of size n.(k + 2) + 1. 

A configuration c G Q z is said legal if c(z) ^ # for all z and if the projection of c on 
the second component of states (which is well-defined) is periodic of period 1 • 2 ■ • • (k + 2). 
Thus, for any legal configuration c and any position z, the set of states of cells which are 
neighbours of z is of the form: 

Ei(a\, . . . , a*;) = {(aij i), (a,2,i + 1 mod k + 2), {a k , i + k — 1 mod k + 2)} 

for some i G {1, . . . , k + 2} (with a,j G Q n for all j). ^(A) is defined by the local rule / as 
follows: 

f (<U(ai, • • .,a k ),i+ [k/2\ mod k + 2) if {x x , ...,x k } = Ei(ai, . . .,a k ), 
I # else. 



f(xx, ...,x k ) 



By construction, we have ^(^4.) G Set. Moreover the encoding preserves universality. 
As a direct corollary, we get the undecidability of universality in family Set (universality 
was proven undecidable in the general case in [6]). 

Theorem 4.1. The encoding ^ : CA — > Set satisfies the following: 

(1) A ^ y{A) for all A, 

(2) A is universal if and only if^>(A) is universal. 



Captive set CA. We denote by KSet the intersection K n Set. The previous construction 
does not generally produce captive CA (even if the original CA is captive). We now describe 
a new encoding which produces only CA belonging to KSet. It could have been used to 
prove the existence of universal set CA, but we have no proof that it satisfies the second 
assertion of theorem 14.11 (hence the usefulness of previous construction) . 

The new mapping ip : CA — > KSet is an adaptation of ^f. We keep the idea of states 
being a cartesian product of the original alphabet Q n and a family of labels which is 
in this case {0, ...,2k — 2}. But, in order to have every transition satisfying the captive 
constraint, we introduce 'libraries' of states placed regularly in legal configurations: between 
two computing cells, we place the i-th library for some i, denoted by Lj, which contains the 
n states {(x, i)}xeQ n - F° r technical reasons, it also contains special states (#,i) and (#',i), 
and it is ordered as follows: Lj = (#, i) ■ (1, i) ■ (2, i) ■ ■ ■ (n, i) ■ (#', i). Thus, tp(A) has state 
set Q = {0, . . . , 2k - 2} x (Q n U {#, #'}). 
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The simulation of A by <p(A) takes place on 'legal' configurations denned by an alter- 
nation of an isolated state of label i, and a library of type k + i, precisely: 

••• (<Zi,*) ■ L fe+ j mod2 A:-i • (<?2, i + 1 mod 2k — 1) • L k+i+ i mod 2 k-i ■ • ■ 

Those legal configurations will be maintained in one-to-one correspondence with configu- 
rations of A, successive isolated states between libraries corresponding to successive states 
from A. However, this time, the simulation of 1 step of A will use 2 steps of <p(A) and only 
even time steps of <p(A) (including time 0) will produce legal configurations. For odd time 
steps, we introduce 'intermediate' configurations defined by an alternation of an isolated 
state of label i, and a library of type r + i, precisely: 

••• (<Zi>*) ' mo d 2fc-i • (<?2, i + 1 mod 2k — 1) • L r+i+1 mo( j 2 k-i • • ■ 

where r = [k/2\ is the radius of A. 

To describe the local rule of <p(A), we introduce the following sets: 

• Vi(a±, . . . , afc) = {(ai,i), (a 2 ,i + 1 mod 2k — 1) . . . (a^, z + — 1 mod 2/c — 1)}; 

• Lj is the set of states present in the word Lj; 

• Bi :X = (1, i), (6 — l,i)} is the set of states in the prefix of of Lj of length 

x; 

• = {(e, i), (n, z), (#', ?)} is the set of states in the suffix of Lj of length 
n — x + 1. 

(/?(«4) has arity k' = k + (k — 1) • (n + 2) and, on legal configurations, the set of states 
seen in a neighbourhood has one of the following types: 

Tl: Vi(ai, . . . , Ofc) U Li + k mod 2k-l U . . . U Li + 2k-2 mod 2k-l'i 
T2: Vi(ai, . . . , a k ) U E i+ k-l mod 2k-l,x U -^i+A: mod 2fe-l U . . . 
. . . U L i+ 2k-3 mod 2fc-l U B i+2 k-2 mod 2k-l,x- 

On intermediate configurations, the set of states seen in a neighbourhood has one of 
the following types: 

T3: Vi(ai, . . . , a k ) U Lj_ r mo ^ 2 fe-i U . . . U -^i-r+fe-2 mod 2fc-i! 
T4: Vj(ai, . . . , a^) U -Ej_ r _i mo d 2fc-2,a; U Lj_ r mo ^ 2 fc-i U . . . 

. . . U mod 2fc-l U B i _ r+k _ 2 mod 2k-l,x- 

An important point is that the 4 types are disjoint: it is obvious that each of Tl and 
T3 is disjoint from each of T2 and T4, and the overall disjointness follows from the fact 
that sets of type T3 and T4 have less elements than Tl and T2 since set Li are disjoint but 

Vi(ai, . . . ,a k ) n Lj ^ <^> i < j < i + k - 1 

Using notations above, the behaviour of p(A) is defined by 4 kinds of transitions ac- 
cording to the kind of neighbourhood seen: 

Tl: -» (fo(ai,...a fc ),i + 2fc-2mod2A;-:L) 
T2: —> (x,i + k — l mod 2k - 1) 
T3: -> (a 1+r ,i + r mod 2/c - 1) 
T4: -> + r) 

The crucial point for transition of type T3 to be well-defined is that ai+ r can be unam- 
biguously determined given that the libraries present have labels ranging from i — r — 1 to 
i — r + k — 2 = i + r — 1 whereas a\ +r is associated to label i + r in Vi(a\, . . . , a k ) (every- 
thing is taken modulo 2k — 1). 
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Intuitively, type Tl corresponds to simulation of transitions of A and the three other 
types are devoted to the modification of label of isolated states or the displacement of 
libraries according to the following scheme: 

• At even steps, transitions of type Tl apply the local rule 5 a, but the result receive a 
label j such that Lj is present in the neighbourhood to satisfy captivity constraint; 
meanwhile, transitions of type T2 just shift the libraries. 

• At odd steps, the difference of labels between libraries and isolated states is wrong; 
to come back to a legal configuration, transitions of type T3 leave isolated states 
unchanged while transitions of type T4 shift the libraries. 

To completely define <p(A), we fix some ordering on Q and specify that, when the set E 
of neighbours doesn't correspond to any of the 4 types above, the local rule of ip(A) simply 
chooses the greatest state in E. With that definition, cp(A) always belong to KSet, because 
it depends only on the set of states in the neighbourhood, and because each transition 
produces a state already present in the neighbourhood (either the neighbourhood contains 
Li for the right value of i, or the local rule simply chooses the greatest element). 

Theorem 4.2. For any A, we have A ^ <p(A). Therefore families MS, KMS, Set and 
KSet contain universal CA. 

The construction above corresponds to the strongest symmetry constraint (captive set 
CA), put aside totalistic CA. The existence of (intrinsically) universal totalistic CA is proven 
in [5]. The case of outer-totalistic CA follows by inclusion. 

5. Universality Everywhere 

Gathering the density results of section [3] and the existential results for universality in 
section 01 we obtain an asymptotic density 1 for universality in the following classes. 



Family T 


Condition on the path p 


Comments 


Captive CA 


3k 


s.t. 


p(x) = (x,l 


so) 


Already in [9 




Multiset CA 


3n 


s.t. 


p(x) = (n 


x) 








fc'-outer-multiset 


3n 


s.t. 


p(x) = (n 


x) 


k' = o(lo| 




jfc)) 


Totalistic CA 


3n 


s.t. 


p(x) = (n 


x) 








fc'-outer-totalistic 


3n 


s.t. 


p(x) = (n 


x) 


k! = o(lo| 






Set captive CA 


limj 




7riO(aO) = 


+oo 








Multiset captive CA 


lim 2 




7riO(aO) = 


+oo 









6. Open Problems and Future Work 

As summarised in the previous section, our work establishes that universality has as- 
ymptotic density 1 along path p in several families defined by local symmetries, provided p 
verifies some hypothesis depending on the family considered. 

Notably, we leave open the question of the density of universality in the following cases: 

• increasing state set for families MS, Set, Tot (and outer- versions), 

• increasing neighbourhood for family K. 
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We have no result (and no intuition) concerning the case of the whole set of CA either. 
A possible progress on that topic could be to reduce the density problem of a family T\ to 
the density problem of a family J-2, i.e. to show that the densities (if they exist) in the two 
families are equal up to non-trivial multiplicative constants. 

Another perspective, especially for multiset CA (or sub-families Set and Tot), is to 
extend our result to higher dimensions or even to more general lattice of cells. Indeed, the 
symmetry involved here implies isotropy which is an often required property in modelling. 

Finally, it remains to study typical dynamics obtained in each family from random 
initial configuration. Experiments suggest that self-organisation in those families is far 
more frequent than in CA in general. 
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